Self-Organizing Maps for Clustering in Document Image Analysis

نویسندگان

  • Simone Marinai
  • Emanuele Marino
  • Giovanni Soda
چکیده

In this chapter, we discuss the use of Self Organizing Maps (SOM) to deal with various tasks in Document Image Analysis. The SOM is a particular type of artificial neural network that computes, during the learning, an unsupervised clustering of the input data arranging the cluster centers in a lattice. After an overview of the previous applications of unsupervised learning in document image analysis, we present our recent work in the field. We describe the use of the SOM at three processing levels: the character clustering, the word clustering, and the layout clustering, with applications to word retrieval, document retrieval and page classification. In order to improve the clustering effectiveness, when dealing with small training sets, we propose an extension of the SOM training algorithm that considers the tangent distance so as to increase the SOM robustness with respect to small transformations of the patterns. Experiments on the use of this extended training algorithm are reported for both character and page layout clustering.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gait Based Vertical Ground Reaction Force Analysis for Parkinson’s Disease Diagnosis Using Self Organizing Map

The aim of this work is to use Self Organizing Map (SOM) for clustering of locomotion kinetic characteristics in normal and Parkinson’s disease. The classification and analysis of the kinematic characteristics of human locomotion has been greatly increased by the use of artificial neural networks in recent years. The proposed methodology aims at overcoming the constraints of traditional analysi...

متن کامل

Gait Based Vertical Ground Reaction Force Analysis for Parkinson’s Disease Diagnosis Using Self Organizing Map

The aim of this work is to use Self Organizing Map (SOM) for clustering of locomotion kinetic characteristics in normal and Parkinson’s disease. The classification and analysis of the kinematic characteristics of human locomotion has been greatly increased by the use of artificial neural networks in recent years. The proposed methodology aims at overcoming the constraints of traditional analysi...

متن کامل

Gait Based Vertical Ground Reaction Force Analysis for Parkinson’s Disease Diagnosis Using Self Organizing Map

The aim of this work is to use Self Organizing Map (SOM) for clustering of locomotion kinetic characteristics in normal and Parkinson’s disease. The classification and analysis of the kinematic characteristics of human locomotion has been greatly increased by the use of artificial neural networks in recent years. The proposed methodology aims at overcoming the constraints of traditional analysi...

متن کامل

Gait Based Vertical Ground Reaction Force Analysis for Parkinson’s Disease Diagnosis Using Self Organizing Map

The aim of this work is to use Self Organizing Map (SOM) for clustering of locomotion kinetic characteristics in normal and Parkinson’s disease. The classification and analysis of the kinematic characteristics of human locomotion has been greatly increased by the use of artificial neural networks in recent years. The proposed methodology aims at overcoming the constraints of traditional analysi...

متن کامل

Steel Consumption Forecasting Using Nonlinear Pattern Recognition Model Based on Self-Organizing Maps

Steel consumption is a critical factor affecting pricing decisions and a key element to achieve sustainable industrial development. Forecasting future trends of steel consumption based on analysis of nonlinear patterns using artificial intelligence (AI) techniques is the main purpose of this paper. Because there are several features affecting target variable which make the analysis of relations...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008